Job-Length Estimation and Performance in Backfilling Schedulers

نویسندگان

  • Dmitry N. Zotkin
  • Peter J. Keleher
چکیده

Backklling is a simple and eeective way of improving the utilization of space-sharing schedulers. Simple rst-come-rst-served approaches are ineeective because large jobs can fragment the available resources. Backklling schedulers address this problem by allowing jobs to move ahead in the queue, provided that they will not delay subsequent jobs. Previous research has shown that inaccurate estimates of execution times can lead to better backklling schedules. We characterize this eeect on several work-loads, and show that average slowdowns can be eeec-tively reduced by systematically lengthening estimated execution times. Further, we show that the average job slowdown metric can be addressed directly by sorting jobs by increasing execution time. Finally, we modify our sorting scheduler to ensure that incoming jobs can be given hard guarantees. The resulting scheduler guarantees to avoid starvation, and performs signiicantly better than previous backklling schedulers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Session-Based, Estimation-less, and Information-less Runtime Prediction Algorithms for Parallel and Grid Job Scheduling

The default setting of most production parallel job schedulers is FCFS with backfilling. Under this setting, users must supply job runtime estimates, which are known as being highly inaccurate and inferior to system generated predictions. Recent research revealed how to utilize system predictions for backfilling, and this potential performance gain motivates searching for better prediction tech...

متن کامل

Power-aware Resource Allocation via Online Simulation with Multiple-queue Backfilling

Although traditional scheduling policies for high-end parallel systems focus on minimizing average job wait time while maximizing system utilization, actual supercomputer workload traces confirm the existence of significant periods of time of low utilization. Previous work has shown that, in the context of backfilling schedulers, portions of such high-end systems can be selectively powered down...

متن کامل

Self-Adapting Backfilling Scheduling for Parallel Systems

We focus on non-FCFS job scheduling policies for parallel systems that allow jobs to backfill, i.e., to move ahead in the queue, given that they do not delay certain previously submitted jobs. Consistent with commercial schedulers that maintain multiple queues where jobs are assigned according to the user-estimated duration, we propose a self-adapting backfilling policy that maintains multiple ...

متن کامل

Job scheduling with adjusted runtime estimates on production supercomputers

The estimate of a parallel job’s running time (walltime) is an important attribute used by resource managers and job schedulers in various scenarios, such as backfilling and short-job-first scheduling. This value is provided by the user, however, and has been repeatedly shown to be inaccurate. We studied the workload characteristic based on a large amount of historical data (over 275,000 jobs i...

متن کامل

Backfilling Using Runtime Predictions Rather Than User Estimates

The most commonly used scheduling algorithm for parallel supercomputers is FCFS with backfilling, as originally introduced in the EASY scheduler. Backfilling means that short jobs are allowed to run ahead of their time provided they do not delay previously queued jobs (or at least the first queued job). To make such determinations possible, users are required to provide estimates of how long jo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999